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Abstract. Reputation systems seek to infer which members of a community can 
be trusted based on ratings they issue about each other. We construct a Bayesian 
inference model and simulate approximate estimates using belief propagation 
(BP). The model is then mapped onto computing equilibrium properties of a 
spin glass in a random field and analyzed by employing the replica symmetric 
cavity approach. The theoretical performance in terms of estimation error and 
the robustness of the BP approximation in scenarios involving different topologies, 
parameter mismatches and targeted attacks are evaluated. Having the fraction 
of trustful nodes and environment noise level as control parameters, we also 
calculate theoretical phase diagrams. These diagrams are fully confirmed by 
simulation and show regions of degraded performance explained by the emergence 
of a ferromagnetic ordered spin glass phase. 
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1. Introduction 

Ad-hoc [1] and wireless sensor networks [2] work in the absence of a central authority 
and are increasingly pervasive in modern computer systems. The secure operation 
of these autonomous networks depends on the capability of establishing trust among 
network entities. In general it is reasonable to assume that reputation and trust are 
positively correlated quantities and then employ a mutual scoring system as a source 
of data that can be used to estimate reputations [3, 4, 5]. 

Here we are concerned with the part of a reputation system [6] that identifies 
ill-intentioned individuals or malfunctioning devices by estimating reputations. This 
task would be trivial if the scores provided a reliable representation for the reputation 
of an entity. Instead, evaluation mistakes may happen or misleading ratings may be 
issued on purpose [7]. 

Reputation systems are particularly prone to attacks by malicious entities which 
can corrupt the recommendation process [8, 9, 10]. This happens for instance when 
multiple entities conspire to emit negative ratings about well-intentioned agents while 
emitting positive ratings about co-conspirators. In another form of attack known 
as a Sybil attack, a single entity could impersonate others and trick the reputation 
mechanism. 

The simplest algorithms employed by online communities use average ratings to 
determine reputations. Despite having the advantage of being easy to understand, 
these algorithms do not take into account the possibility of entities committing 
mistakes or acting deceitfully, what often leads to inferior results. More sophisticated 
algorithms employ Bayesian inference [11] or fuzzy logic [12]. Recently, iterative 
formulas over looped or arbitrarily long chains - the so called flow models - have also 
been proposed (e.g. the PageRank algorithm [13]). For a more thorough exposition 
of the range of techniques employed we suggest recent reviews such as [3, 10]. 

In this paper, we employ statistical mechanics techniques to study the 
performance of a belief propagation algorithm to approximately estimate reputations. 
This analysis provides insights into the general structure of the inference problem and 
suggests improvement directions. 

The material is organized as follows. In section 2 we introduce the inference 
model, an algorithm for estimating reputations and a performance measure. In 
section 3 we simulate the algorithm and discuss the results. A theoretical analysis 
is presented in section 4 and phase diagrams are calculated. In section 5 we discuss 
the algorithm robustness by analyzing attacks, parameter mismatches and different 
topologies. Finally, conclusions are provided in section G. 

2. Estimating Reputations 

We model a reputation system along the lines of [14, 15]. An entity i = 1, • • • , n has a 
reputation r^, and issues ratings Jjj about other entities j = 1, • • • , n with j ^ i. We 
define a set of ordered pairs that contains if the rating that i issues about j is 
present. We assume that ratings and reputations arc related by a given function 

J = /(r,tf}), (1) 

where {£} is a set of random variables representing externalities as, for example, 
uncertainties affecting opinion formation or transmission. A model is defined by 
specifying the domains of and Jij, the distribution of {£} and the function /. 



Statistical mechanics of reputation systems in autonomous networks 



3 



A good model should be able to describe realistic scenarios while remaining 
amenable to analytical treatment. Ratings should represent true reputations, namely, 
Jij oc rj. The model should also take into account emitter reputations, as we have 
to consider that an unreliable entity may emit ratings defaming well-intentioned 
individuals or groups, and that a sufficient number of such ratings can misguide the 
reputation system - the collusion phenomena depicted in figure 1 . A simple choice is 

with £y representing noise in the communication channel. We start by choosing 
ri,Jij,£ij G { — 1,1} and £y being a Bernoulli variable with parameter p, that is, 
£jj = 1 with probability p (signal level). 




Figure 1. Collusion: agents of bad reputation issue positive ratings about each 
other and misguide the mechanism. Agents with good reputation arc represented 
by white circles, and those of bad reputation by grey circles; arrow labels indicate 
whether a recommendation is positive (green) or negative (red). 

This choice of f(r, {£}) mitigates the collusion phenomena, as J.^ = 1 represents, 
in the noiseless case, either an ill-intentioned positive recommendation about an 
unreliable entity or a negative recommendation about a reliable entity issued by an 
unreliable node. By introducing £y we also build into the model misjudgmcnts and 
transmission failures. We assume a prior distribution for r supposing independence 
and a fraction q of reliable agents (reputation bias). 

Our goal is to infer r given J. For that we need the posterior distribution P(r\J) 
that can be calculated with help of Bayes theorem to find 

P(r\J) oc P(J\r)P(r) (3) 

« n p te;)II p ( r *)- 

(»,j)en i 

Notice that we can describe a Bernoulli variable with probability p for x — 1 by 
the distribution P(x) oc exp(a p :r), with a p = | log j-^-. This yields 

P(r|J)oc ; expK^Oll 

exp(a 9 r i ) (4) 

(i,j)eQ i 

oc exp a p ^2 + a i r ' 1 ( 5 ) 
V (i,j)en « / 
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oc exp 




(6) 



where a„ 



Alternatively, the dependence structure of the inference model may be represented 
by a directed graph Q — (V, E), with each vertex v £ V standing for an entity, and an 
arc being connected if and only if G fi. 

To use methods of equilibrium statistical mechanics a symmetric J is desirable 
and can be achieved by grouping terms as 



1 



where in the r.h.s. we assume that = if ^ f2. We here only consider 

undirected graphs, to say, both arcs and are in Q if G fi. By replacing 
\{Jij + Jji) for Jij, we work with Jy G { — 1, 0, 1}, discarding dissonant opinions that, 
fortunately, are few in many cases of interest [10]. The case where a single arc may be 
present can be modeled by a simple extension that considers G {—1,-^,0,^,1}. 



Given a sample of ratings {Jij} 
reputations r while keeping the error 

1 
2 



e(r, r) 



1 



our goal is to find an estimate f for the 



(7) 



as small as possible. 

A naive solution calculates reputations according to the majority of recommen- 
dations (a majority rule) 



sgn ( J ki ) . 

\kedi I 



(8) 



where di stands for the neighborhood of the vertex i in Q. Assuming that ratings are 
produced according to eq. 2 we can write 



n = sgn 



£,kirk 



(9) 



Thus we can calculate the probability of agreement between fj and the real 
reputation ?*j by considering that r k and ^ki are random variables sampled from P(r) 
and P(£) (while keeping ^ = £ki)> an d by introducing a random variable A sampled 
from the degree distribution of graph Q: 



P{r i = r i ) = ^P\J2t ik r k >o\ 



(10) 



with w = pq + (1 — p)(l — q), and ^ • denoting the average with respect to A. 

Here, £jfcr& is a Bernoulli variable with parameter w (i.e., it is +1 with probability 
w). The sum of a set of these variables is binomially distributed, and the expression 
in the r.h.s. represents the cumulative distribution. 
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Until section 5 we assume a random regular graph Q with degree c. Also, we deal 
with a scenario such that signal prevails p > 0.5 and most of the nodes are reliable 
q > 0.5. 




Figure 2. Factor graph and BP message-passing. Snapshot of a factor 
graph representing the posterior 11. Variable nodes are represented by circles, 
factor nodes are represented by small squares. At an iteration outgoing messages 
^•i-^tj are computed after combining incoming messages ti^\ fi . When convergence 
is attained we compute effective fields hi and the approximation for marginal 
posteriors P(rj). 

For an entity i the inference task consists of maximizing the posterior for r 
marginalized over every component except i. The posterior is factorizable and can 
be put in the following form 

P(r|J) = | I] eMPJijnrj)l[eMPBn), (11) 

where j3 and B are parameters which are optimally set at Nishimori's condition [16] 
j8 = a q and B = — . Factorizable distributions such as this are well represented by 
factor graphs [17, 18, 19], with variable nodes associated to the {rj} and function (or 
factor) nodes representing the functions linking them, in this case the exponentials. 
Figure 2 zooms in a factor graph representation of the posterior 11. 

Given this posterior distribution we calculate marginal distributions P(ri) = 
Y^r-^n P( r \J) efficiently by employing the message-passing scheme of belief 
propagation (BP) on the factor graph associated to the posterior eq. 11 [19]. In 
our case the outgoing BP messages (see figure 2 for illustration) are 

kedi/j 



while u^\ yi = 4 tanh 1 



are incoming messages. 



tanh(/3J fel )tanh(/3^, i ) 
Iterating this set of equations until convergence we obtain {h*_^.j , } that yields 
an approximation for marginals P{ri) oc exp(/3ft,jrj), with effective fields given by 

k=B+Y l <M- (13) 

fcedi 

This algorithm is exact on trees, but can be used in graphs of any topology, leading 
to good approximations provided that the average cycle length is large [19]. 
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After convergence of the BP scheme reputations are estimated by marginal 
posterior maximization (MPM) 



fj = argmax ri P(ri) = sgn(/i i ). 



(14) 
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Figure 3. Estimation error for the majority rule. Average errors in 
simulations (circles) are compared with theoretical errors calculated by averaging 
eq. 7 over 10 (lines). Averages are over 3000 runs, q = P(r; = 1) = 0.7 or q = 0.8. 
Graphs Q are random and regular with degree c = 3 and n = 100. Error bars are 
smaller than symbols in all three curves. 



3. Simulations 



As a basis of comparison we run the majority rule algorithm defined by eq. 8 for 3000 
scenarios with reputations r chosen randomly with reputation bias q and symmetric 
^ik with signal level p. In figure 3 we compare the average error in simulations of 
the majority rule with the theoretical error calculated by averaging eq. 7 over the 
distribution 10. The majority rule is not very far from what is used in common 
reputation systems on e-commerce websites. Note, however, that the error of this 
very simple scheme can be larger than 1 — q if the signal level p is low enough. The 
message is therefore clear: in noisy environments assigning good reputations by default 
may be actually more effective than using the majority recommendation. 

In figure 4 we compare the majority rule with the average over 3000 runs of the 
MPM estimate computed with the BP algorithm. For convenience, the algorithm is 
presented as a pseudocode in the Appendix A :f. The gains in performance when the 
collusion phenomena is built into the inference model are considerable even in very 
noisy environments. 

A detailed view of the error surface for the approximate MPM estimates in terms 
of reputation bias q and signal level p is depicted in figure 5a. Two regions emerge such 
that when the noise level is low and and most of the entities are trustful the estimation 
error is very low. Other average quantities can also be evaluated in the simulations 
in order to access the algorithm's performance. Figure 5b, for instance, shows the 
fraction of runs for which the algorithm has converged - an empirical probability of 
convergence. 

A distinctive region can be observed in figure 5b where the probability of 
convergence is very low. Other quantities we have studied, such as the average rate of 



| Source code is also available at https://github.com/amanoel/repsys. 
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Figure 4. MPM estimate versus the majority rule. Comparison between 
the average error of the majority rule (eq. 8) and of the MPM estimate (eq. 14), 
for q = P(ri = 1) = 0.7, a regular random graph with degree c = 3 and n = 100. 
For the majority rule we show theory and simulations, with symbols larger than 
error bars. For the MPM estimate we show the average over 3000 simulation runs 
with error bars representing one standard deviation. 
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Figure 5. Average error and probability of convergence: 3000 runs 
of the BP algorithm in a grid of values for signal level p and reputation bias q, 
with n = 100 and c = 3. In each run Q, {rj} and are sampled and ratings 



,} are calculated. Panel (a) depicts the average error as in eq. 



Panel (b) represents the fraction of of runs for which the BP algorithm converges. 



convergence and the overlap of solutions given different initial conditions (i.e., different 
distributions P^°\h)), corroborate the existence of this region with low performance 
due to the dynamic behavior of the algorithm iterative process. 

In the next section we use equilibrium statistical mechanics to calculate the phase 
diagram as a function of the control parameters p and q and to explain low performance 
regions. 
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4. Theoretical analysis 

The posterior distribution 6 suggests the description of the problem of inferring 
reputations in terms of the equilibrium properties of a spin glass in an external field 

H(s\J,B) = - J2 JijSiSj-B^st, ( 15 ) 
(i,j)en i 

where we designate the dynamic variable as Si, and the target variables are fixed 
(quenched). 

The MPM estimates described in section 3 correspond to fi = sgn(mi) with 
rrii = tanh(/3/ij) representing local equilibrium magnetizations. 

At Nishimori's condition [16] temperature and field are chosen as (3n = ot p and 
Bn = and the microstates of this physical system are distributed according to 
Gibbs distribution 

P(s\J) = L e -PNU{s\J,B N ) ( 16 ) 

Other values of (3 and B, corresponding to misspecified p and q are studied in the next 
section. 

We wish to calculate the equilibrium average error of eq. 7 which corresponds to 
the magnetization of the Hamiltonian in eq. 15 (gauge) transformed with r,Sj — > Sj. 
As this Hamiltonian is not gauge invariant we now have to deal with a spin glass in a 
random field 

H(s\&r,B) = - J2 ZijSiSj - Bj2 r i s i- ( 17 ) 

(ij)eo i 

Here {£y} and {n} are quenched variables with P(£,ij = = 1) = p and P(r; = 1) = 
q. The error e(r, r) in the gauge transformed variables can be written ase = i(l— m*), 
where m* = — Si is the gauge transformed equilibrium magnetization. 

In our analysis we employ the replica-symmetric cavity method along the lines of 
[19, 20]. In this section we calculate the phase diagram at Nishimori's condition. We 
first write an equation for the distribution of cavity fields for the gauge transformed 
variables as calculated by the BP procedure in eq. 13: 

P(h) = JjldhiPih^^lh-Br-^Uii^hi)] } rU} , (18) 
or, more concisely 

c-l 

fc^((Br + J>(£i.fci))) , (19) 
i=i r d?.i 

with = indicating equality in distribution. In this context, h is the cavity field, and 

Ui{&, h) = — tanlT 1 [tanh(/3^) tanh(/3/i l )] 

are cavity biases [19, 21]. Note that we work here under the assumption that it makes 
sense to describe fixed points of the BP equations 13 in terms of a unique density 
P(h). That is the replica symmetry (RS) assumption. 
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Figure 6. Theoretical error and phase diagram at Nishimori's 
condition: (a) theoretical error e = ^ ^1 — (sgn(/i)) for different values of 

p and q; (b) phase diagram for the studied system, in which three different 
phases can be distinguished: ferromagnetic (F), for m 0, p = 0, mixed (M), 
ferromagnetically ordered spin glass, for m ^ 0, (i / and paramagnetic (P), for 
m = and p = 0. Notice that for q = 0.5 the model is gauge invariant and no 
mixed phase is found at Nishimori's condition 0n = a p and B jy = 



We calculate numerical solutions to 18 by the population dynamics algorithm 
(see Appendix B for details) and then calculate thermodynamic quantities. The 
magnetization is given by m = (tanh(/3ft,)) / - , where 



h= ({Br + ^Ui(£i,hi 



>{&} 



(20) 



is the effective field. Note that the sum here ranges from 1 to c. 

The onset of a spin glass phase can be detected by finding divergences in the spin 
glass susceptibility. Provided that the disorder is spatially homogeneous, the spin 
glass susceptibility averaged over this disorder can be written as: 



x 89 = E^)(([M-( s o)W] : 



(21) 



where sq is a variable at an arbitrary central site, se is an arbitrary variable at a site 
separated from by a chemical distance I and N{i) is the number of sites at a distance 
t from 0. 

The fluctuation-dissipation theorem, the symmetry introduced by averaging and 
the BP equations 13 yield 

dm Q dho du.^ du.^i 



oo 

E 

£=0 



m) 



dh du.^ du.^e dh e 



where u.^q and u.^i represent incoming messages in a path connecting to 
sufficient condition for \ sg to diverge is, therefore, that 



lim 



du. 



> 0. 



(22) 
i. A 

(23) 
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This quantity measures the sensibility of the incoming message at a central site to a 
perturbation in a message forming at a far outside distance £. In terms of cavity field 
distributions we can write 

H<( 

A number of numerical methods can be used to evaluate p [21, 20]. Using 
population dynamics (described in Appendix B), we introduce two slightly different 
initial states such that u' [i] — u n [i] = S with i = 1, • • • ,N and 5 = 10~ 4 . After a large 
number t of iterations of the population dynamics algorithm we calculate 

/^£«M-M*]) 2 - ( 25 ) 

i 

The order parameters m and p allow the identification of four different thermodynamic 
phases: paramagnetic if p = 0, m = 0, ferromagnetic if p = 0, m ^ 0, glassy if p ^ 0, 
m = and, mixed or ferromagnetic ordered spin glass for p ^ 0, m ^ 0. 

To measure the theoretical error, we also calculate (sgn(/i))?. Discounting finite 
size effects, theoretical results depicted in figure 6a are corroborated by simulations 
depicted in figure 5a. Most notably, the mixed glassy phase identified in the phase 
diagram plotted in figure 6b is located in the same region for which the empirically 
measured algorithm's performance degenerates. Such curious behavior is due to the 
rough energy landscape of the glassy phase so that any algorithm would encounter 
difficulties in finding a high quality solution for the inference problem in this particular 
region of parameters. 

At Nishimori's condition it is relatively easy to check that for every (p, q) there 
is also a state in the simple form P FEB . RO (h) = S(h — ho), with ho depending on (p, q), 
that is an extremum of the replica symmetric free energy. By fixing (p, q) and running 
the population dynamics algorithm with different initial conditions we can check for 
the uniqueness of these solutions. Note, however, that the existence of other states 
that are also fixed points of the population dynamics with small basins of attraction 
cannot be ruled out. Nevertheless, evidence that the mixed state is thermodynamically 
dominant is provided by obtaining the same states as fixed points of the population 
dynamics for a large number of random initial conditions and by finding that the free 
energies of these states are lower than the free energy of P FERRO (h). 

Two rigorous results on phase diagrams of similar models are available: (A) if the 
Hamiltonian is gauge invariant the Nishimori line does not cross a spin glass phase 
[22] and (B) there is no spin glass phase in a random field Ising model [23]. In order 
to check our results we observe that fixing q = 1/2 (a q = 0) yields a gauge invariant 
model. The phase diagram in figure 6b and the theoretical error depicted in figure 
6a exhibit only paramagnetic and ferromagnetic phases over the line q = 0.5 and are, 
therefore, consistent with rigorous result (A). Result (B) is only relevant if we can 
choose parameters such that £y > (or p = 1) for any € f^, any random field 
and any B in the Hamiltonian 17. At Nishimori's condition, however, p = 1 implies 
that Pn = oo and Bn = 0. Thus the model is a trivial ferromagnet and result (B) 
is irrelevant. Yet if the inference model assumes p < 1 while £jj are generated with 
p = 1, rigorous result (B) forbids either a spin glass or a mixed phase to show up. 
This scenario is studied in the next section and, indeed, no mixed phase is found for 
p = 1 and p < 1 as it can be seen in figure 7d. 



du [ui(h)\ 
du£ 



(24) 
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5. Robustness 



To this point our analysis has only considered Nishimori's condition, namely, {p, q} 
employed in the inference algorithm set to be the same {p, q] used to generate the 
data. We also supposed ratings {Jij} distributed over a regular random graph of fixed 
degree c = 3 and issued exactly as assumed by the inference model. In this section, 
we relax these assumptions to access both the performance of the algorithm and the 
validity of the theoretical analysis under more general conditions. 



5.1. Parameters mismatch 

The inference algorithm has to assume (or estimate) values p and q for the signal level 
and reputation bias. Ideally these parameters have to be set to the same levels used 
to generate the data. However, the environment can change without warning and we 
would like to know how the inference scheme would perform in such circumstances. 
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(a) empirical error 
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Figure 7. Empirical and theoretical analysis for mismatched signal level 
p with reputation bias q = 0.7. Panels (a) and (c): empirical and theoretical 
error. Panels (b) and (d): probability of convergence and phase diagram. The 
performance is only sensitive to small parameter mismatches in the neighborhood 
of a phase transition as, for instance, p = 0.8 in this figure. As it is expected, 
there is no mixed phase if the true signal level is p = 1, while a different p is 
employed in the inference model. 

By simulation we can generate the vector r and the symmetric matrix £ as 
Bernoulli variables with probability q and p, respectively, and then run the algorithm 
assuming ap and a.q. Analogously, in the theoretical analysis, we can just set 
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13 = ap and B = ^ while considering r.j and £j as quenched Bernoulli variables 
with parameters q and p. 

Figure 7 depicts results for both analysis in the plane p—p for q = 0.7. The 
theoretical description is still accurate. Small mismatches between p and p are in 
general well absorbed by the inference scheme, except when the real p is situated close 
to a phase transition. A quick inspection of figure 6b reveals that for q = 0.7, the 
neighborhood of p ~ 0.8 exemplifies a specially sensitive region. Performance also 
deteriorates when estimates, in this case for the signal level, are too optimistic. 




0.5 0.6 0.7 0.8 0.9 1 0.5 0.6 0.7 0.8 0.9 1 
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Figure 8. Empirical and theoretical analysis for a random graph with 
degree distribution given by eq. 26 and q = 0.7. The parameter c is the average 
node degree while p is the signal level. Panels (a) and (c) depict empirical and 
theoretical errors. Panel (b) shows the average number of iterations of the BP 
algorithm needed for convergence and panel (d) depicts the theoretical phase 
diagram. 



5.2. Graph topologies 

The set fl of ratings issued by network entities define a graph Q with each vertex 
representing an entity and an edge being connected if and only if G 0. 

The theoretical analysis based on the replica-symmetric cavity equation 18 relies on 
specifying an ensemble of graphs represented by a particular degree distribution (or 
profile). In the previous discussion we have used an ensemble of regular random graphs 
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with a degree profile given by A c (7) = S (7 — c). A natural extension to that is allowing 
fractional values for c, for that we introduce: 

f l-(c-LcJ) for 7 =LcJ, 
A c ( 7 ) = < c-LcJ for 7 =LcJ+l, (26) 

I otherwise. 

In this way for c = 2.3 we would have 30% of the agents with degree 3 and 70% with 
degree 2. 

The empirical analysis is done by simulating instances sampled from this ensemble 
of graphs. The left panel in figure 8 shows the theoretical and empirical average error 
surfaces in the plane p-c for q = 0.7. The right panel compares another measure 
of performance - the average number of iterations the BP algorithm needs to attain 
convergence in simulations - and the theoretical phase diagram. 
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Figure 9. (Color online) Results for empirical analysis performed over a square 
lattice, for q = 0.7 and different values of p. Despite the short cycle lengths, the 
algorithm presents a good performance. 



The BP algorithm calculates exact marginals and allows for optimal Bayesian 
inference when the subjacent graph is a tree. The performance of the BP algorithm, 
however, can be studied by simulation on any topology. Figure 9 shows the resulting 
performance measures as a function of p for q = 0.7 and for Q chosen to be a square 
lattice in two dimensions. The average error is always smaller than 1 — q and vanishes 
as p — > 1 showing that even in this case the BP algorithm may yield good results. Also 
as we tune p a glassy phase appears where the probability of convergence decreases 
and the average number of iterations needed to attain convergence increases steeply. 



5.3. Attacks 

Malicious entities may issue ill-intentioned ratings to trick the reputation system 
and malfunctioning devices may issue erroneous ratings. These scenarios of targeted 
attacks can also be considered by the inference model and studied within the same 
theoretical framework and simulation techniques. 

We suppose that a fraction / of the ratings are issued by noisy entities while the 
inference process remains unchanged. 

Lets call the subset of noisy ratings 0/ C O. To simulate this scenario, we 
uniformly sample and fix a fraction / of the entities to be noisy, issuing ratings as 
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Jij = Viji where n^ = rjji is a Bernoulli variable with parameter z — i and G f2/. 
We then run the BP algorithm and study how the performance changes as the value 
of / grows. 




Figure 10. Factor graph for the posterior 5.3. A fraction / of the function nodes 
represent noisy ratings (red squares). 

For the theoretical analysis we calculate performance measures averaged over 
every disorder component: regular random graph with c = 3 in our example, 
symmetric communication noise £, symmetric random ratings r\ and symmetric ratings 
J. As we are interested in checking algorithm robustness, we also assume that the 
inference scheme has no knowledge that the reputation system is under attack. 

We first rewrite the posterior by taking into account noisy ratings: 

P(r|J)cx J] P(£«) J] P( % -)Ip(n). (27) 
Following the previous steps yields: 



P(r\J) oc exp 



op [ J2 7 - • — E ? to + /E r ' 

di,j)en\si f p p i 



Since the algorithm considers the ratings ilf as subject to the same 
communication noise as regular ratings, we have a z = a p . The gauge transformed 
Hamiltonian for an equilibrium statistical mechanics description is 

H(s) = - &3 s i s J- '/,.,r, /%>,>, /i^ (28) 

There is a new term — r ]ij r i r j s i s j m the Hamiltonian, which can 

be treated by the inclusion of a new type of function node to the factor graph 
representation for the posterior 5.3. Figure 10 provides a snapshot of this factor 
graph. The replica symmetric cavity description can then be written as 

U((sr + £t*(/0)) (29) 

i— 1 

where Ui(h) = ^ tanh -1 [tanh(/3£j) tanh(/3/ij)] with probability 1 — /, and with 
probability / 

Ui(h) = — tanh -1 [taxih(firiirri) tanh(^Zij)] , 
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Figure 11. Fraction / of agents issues random ratings. Panel (a) and 
(b) depict simulation results for the error and for the probability of convergence. 
Panels (c) and (d) show theoretical error and phase diagram using the replica 
symmetric cavity approach. As expected, the error increases with /. 



where r\i is a Bernoulli variable with P{j]i = 1) = |, r is the same used in eq. 29 and 
the {ri} are independently sampled from r. 

Figure 11 depicts simulation results for the error in Panel (a) and for the 
probability of convergence in Panel (b). Panels (c) and (d) show theoretical error 
and phase diagram using the replica symmetric cavity approach. 



6. Conclusions 



The use of a belief propagation (BP) algorithm for approximate inference in reputation 
systems has been introduced in [24, 25] . We here extend previous work by calculating 
performance measures using the replica symmetric cavity approach after expressing 
the inference problem in terms of equilibrium statistical mechanics. We apply this 
framework to a basic scenario and to three simple variations. 

The framework is very general and allows for the study of the algorithm 
performance when subjected to several scenarios of practical interest such as the 
presence of collusions, parameter mismatches and targeted attacks. Other questions 
of practical interest remain. Algorithms based on BP approximate inference seem 
to represent an interesting alternative for reputation systems such as wireless sensor 
networks, however implementation details that have been purposefully ignored in our 
analysis certainly deserve a more thorough analysis. For instance, sensors often 
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operate with very limited resources; the sampling of ratings and running of the 
algorithm should be scheduled taking these limitations into account. Also, in a 
distributed scheme, it is necessary to specify how each sensor will build the matrix J 
from the available information. 

From a theoretical point of view reinforced belief propagation or survey 
propagation techniques promise better results in the deteriorated performance glassy 
phase. Also, expectation maximization-belief propagation [26] could allow the 
algorithm to run without the need of supplying signal level p and reputation bias 
q as inputs. For scenarios involving targeted attacks more information could be built 
in the rating mechanism that may allow for improved inference algorithms. 
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Appendix A. Computation of marginals using belief propagation 

The algorithm takes as input a distribution P(h) from which the messages are initially 
sampled, a maximum to the number of iterations i max , a precision e for convergence 
and estimated values of p and q, {p, q}. In what follows, we have used pW(h) = 5(h), 
t max ~ 100, e ~ 10~ 7 and {p, q} = {p, q} — this last condition is later relaxed. The 
complete pseudocode is as follows: 

Input: P(°\h), i max , e, {p,q}; {Jy}, Q 
Output: {ft} 

1: initialize sampling from p(°\h) 

2: j3 aip, B <— aq/otp 

3: while A > e and t < t max do 

4: for i = 1 — » n, j E di do 

5: K^j = B + J2kedt/j ( J ki j hk^i ) 
6: end for 

7: A <- max | h'^j - hi^j \ 

8: t <-* + l, {hi^j} <- {h^j} 

9: end while 

10: if A < e then 

11: for i = 1 to n do 

12: hi = B + Y^kedi Uk^i(Jki, h k ^i) 

13: fi = sgn(ftj) 

14: end for 

15: end if 
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Appendix B. Population dynamics 

The population dynamics algorithm provides an approximated solution to 19 by 
iterating 



uf ] = i tanh" 1 |tanh(/?£) tanh(/3/if ) ) 
h\ =Br + J2 J = 1 u)'. 



(B.l) 



We introduce two arrays of length N = 10 4 : h and u. At the first step, the 
elements of u are initialized. We have considered two possible ways of initializing 
u: by uniformly sampling from [—£,£], e = 10~ 2 , or by assigning uf 1 ' = £j (i.e., 
tanh(/3/i^ ) = 1); the results obtained in our analysis were very similar for both. For 
discussions regarding the use of different initial conditions, the reader may refer to 
[21, 20]. 

Next, the elements of h arc updated according to the rule, with {uj} uniformly 
sampled from u and r sampled from P(r); and the elements of u are calculated from 
the respective element in h and £ sampled from P(£). The process is repeated r — 3000 
times. After this large number of iterations, the array h should be approximately 
distributed as the real distribution P(h), and we are then able to calculate the desired 
averages. 

In order to calculate m = (tanh(j3h)j . , we may introduce an array h with the 

same length N, and since h is given by a sum with an extra term, the array elements 
are computed by simply summing the elements of h with some uniformly sampled 
element of u. We then have m ~ 4= Yli tauh(h[i]). 



References 



[1] M. Barbeau and E. Kranakis. Principles of Ad Hoc Networking. Wiley, 2007. 
[2] W. Dargie and C. Poellabauer. Fundamentals of Wireless Sensor Networks: Theory and 
Practice. Wiley, 2010. 

[3] J. Sabater and C. Sierra. Review on Computational Trust and Reputation Models. Artificial 

Intelligence Review, 24(l):33-60, September 2005. 
[4] L. Mui. Computational Models of Trust and Reputation: Agents, Evolutionary Games, and 

Social Networks. PhD thesis, Massachusetts Institute of Technology, 2002. 
[5] E. Buchmann. Trust Mechanisms and Reputation Systems. In Dorothea Wagner and Roger 

Wattenhofer, editors, Algorithms for Sensor and Ad Hoc Networks, pages 325-336. Springer, 

Berlin, 2007. 

[6] F. G. Marmol and G. M. Perez. Towards pre-standardization of trust and reputation models for 
distributed and heterogeneous systems. Computer Standards & Interfaces, 32(4):185-196, 
June 2010. 

[7] A. Josang and J. Golbeck. Challenges for robust of trust and reputation systems. In Proceedings 
of the 5th International Workshop on Security and Trust Management, number September, 
2009. 

[8] F. G. Marmol and G. M. Perez. Security threats scenarios in trust and reputation models for 

distributed systems. Computers & Security, 28(7):545— 556, October 2009. 
[9] K. Hoffman, D. Zage, and C. Nita-Rotaru. A survey of attack and defense techniques for 

reputation systems. ACM Computing Surveys, 42(1), December 2009. 
[10] A. j0sang, R. Ismail, and C. Boyd. A survey of trust and reputation systems for online service 

provision. Decision Support Systems, 43(2):618-644, March 2007. 
[11] L. Mui, A. Halberstadt, and M. Mohtashemi. Notions of Reputation in Multi-Agents Systems: 

A Review. In Proceedings of AAMAS '02, pages 280-287, New York, USA, 2002. ACM Press. 
[12] J. Sabater and C. Sierra. REGRET: reputation in gregarious societies. In Proceedings of 

AGENTS '01, pages 194-195, New York, USA, 2001. ACM Press. 



Statistical mechanics of reputation systems in autonomous networks 



18 



[13] L. Page, S. Brin, R. Motwani, and T. Winograd. The PagcRank Citation Ranking: Bringing 

Order To The Web. Technical report, Stanford University, 1998. 
[14] S. Ermon, L. Schenato, and S. Zampieri. Trust Estimation in autonomic networks: a statistical 

mechanics approach. In Proceedings of the 48th IEEE Conference on Decision and Control 

(CDC), pages 4790-4795, December 2009. 
[15] T. Jiang and J. S. Baras. Trust evaluation in anarchy: A case study on autonomous networks. 

In Proceedings of IEEE Infocom, 2006. 
[16] Y. Iba. The Nishimori line and Bayesian statistics. J. Phys. A: Math. Gen., (32):3875-3888, 

1999. 

[17] F.R. Kschischang, B.J. Frey, and H.-A. Loeliger. Factor graphs and the sum-product algorithm. 

Information Theory, IEEE Transactions on, 47(2):498 -519, feb 2001. 
[18] H.-A. Locligcr. IEEE Signal Processing Magazine, January 2004. 

[19] M. Mezard and A. Montanari. Information, Physics, and Computation. Oxford University 
Press, USA, 2009. 

[20] Y. Matsuda, H. Nishimori, L. Zdeborova, and F. Krzakala. Random-field p-spin-glass model on 
regular random graphs. Journal of Physics A: Mathematical and Theoretical, 44(18):185002, 
May 2011. 

[21] L. Zdeborova. Statistical physics of hard optimization problems. Acta Physica Slovaca. Reviews 

and Tutorials, 59(3):169-303, June 2009. 
[22] H. Nishimori. Statistical Physics of Spin Glasses and Information Processing. Oxford University 

Press, 2001. 

[23] F. Krzakala, F. Ricci-Tersenghi, and L. Zdeborova. Elusive Spin-glass Phase in the Random 

Field Ising Model. Physical Review Letters, 104:207208, 2010. 
[24] S. Ermon. Trust Estimation in autonomic networks: a message passing approach. In Proceedings 

of NECSYS, Venice, 2009, 2009. 
[25] E. Ayday. Application of belief propagation to trust and reputation management. In Proceedings 

of IEEE International Symposium on Information Theory (ISIT), pages 2173-2177, July 

2011. 

[26] A. Decelle, F. Krzakala, C. Moore, and L. Zdeborova. Asymptotic analysis of the stochastic block 
model for modular networks and its algorithmic applications. Physical Review E, 84(6):1— 25, 
December 2011. 



